The Generation of Regional Pronunciations of English for Speech Synthesis1
نویسنده
چکیده
Welsh and Northern English), and two American ones (New York and South Carolina, to represent Eastern and Southern American); regional features were based primarily on the descriptions in [1], with native-speaker input where possible. The regional accents are abbreviated in this paper as: Br(Sc) = Edinburgh; Br(W) = Cardiff; Br(N) = Leeds; Am(E) = New York; and Am(S) = South Carolina. For the standard accents, Br(RP) = RP, and Am(Gen) = General American. Most speech synthesisers and recognisers for English currently use pronunciation lexicons in standard British or American accents, but as use of speech technology grows there will be more demand for the incorporation of regional accents. This paper describes the use of rules to transform existing lexicons of standard British and American pronunciations to a set of regional British and American accents. The paper briefly discusses some features describes of the regional accents in the project, and the framework used for generating pronunciations. Certain theoretical and practical problems are highlighted; for some of these, solutions are suggested, but it is shown that some difficulties cannot be resolved by automatic rules. However, although the method described cannot produce phonetic transcriptions with 100% accuracy, it is more accurate than using letter-to-sound rules, and faster than producing transcriptions by hand. The accents generated represent fairly educated regional speech, though some optional rules were included which produce broader accents. The division between 'obligatory' and 'optional' rules is somewhat artificial, as there may be speakers from the region who have a noticeably local accent but do not use all of the 'obligatory' rules as their speech is somewhat closer to the standard accent. However, it enables us to produce pronunciation lexicons which represent the main features of the regional accents, while allowing some freedom of variation.
منابع مشابه
The generation of regional pronunciations of English for speech synthesis
Welsh and Northern English), and two American ones (New York and South Carolina, to represent Eastern and Southern American); regional features were based primarily on the descriptions in [1], with native-speaker input where possible. The regional accents are abbreviated in this paper as: Br(Sc) = Edinburgh; Br(W) = Cardiff; Br(N) = Leeds; Am(E) = New York; and Am(S) = South Carolina. For the s...
متن کاملLexical and Acoustic Adaptation for Multiple Non-Native English Accents
This work investigates the impact of non-native English accents on the performance of an large vocabulary continuous speech recognition (LVCSR) system. Based on the GlobalPhone corpus [1], a speech corpus was collected consisting of English sentences read by native speakers of Bulgarian, Chinese, German and Indian languages. To accommodate for non-native pronunciations, two directions are follo...
متن کاملAutomatic Pronunciation Dictionary Generation from Wiktionary and Wikipedia
In this work we show that dictionaries from the World Wide Web which contain phonetic notations may represent a good basis for the rapid pronunciation dictionary creation within the speech recognition and speech synthesis system building process. As a representative dictionary, we selected wiktionary.org [1] since it is available in multiple languages, and in addition to the definitions of the ...
متن کاملL2 English learners' recognition of words spoken in familiar versus unfamiliar English accents
How do L2 learners cope with L2 accent variation? We developed predictions based upon the Perceptual Assimilation Model-L2 (PAM-L2) and tested them in an eye-tracking experiment using the visual world paradigm. L2-English learners in Australia with Chinese L1 were presented with words spoken in familiar Australian-accented English (AusE), and two unfamiliar accents: Jamaican Mesolect English (J...
متن کاملAutomatic generation of multiple pronunciations based on neural networks
We propose a method for automatically generating a pronunciation dictionary based on a pronunciation neural network that can predict plausible pronunciations (alternative pronunciations) from the canonical pronunciation. This method can generate multiple forms of alternative pronunciations using the pronunciation network. For generating a sophisticated alternative pronunciation dictionary, two ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997